UCD : Upper Con dence bound for rooted Directed acyclic graphs

نویسندگان

  • Abdallah Saffidine
  • Tristan Cazenave
  • Jean Méhat
چکیده

In this paper we present a framework for testing various algorithms that deal with transpositions in Monte-Carlo Tree Search (MCTS). When using transpositions in MCTS, a Direct Acyclic Graph (DAG) is progressively developed instead of a tree. There are multiple ways to handle the exploration exploitation dilemma when dealing with transpositions. We propose parameterized ways to compute the mean of the child, the playouts of the parent and the playouts of the child. We test the resulting algorithms on several games. For all games, original configurations of our algorithms improve on state of the art algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Volume Requirements of 3D Upward Drawings

This paper studies the problem of drawing directed acyclic graphs in three dimensions in the straight-line grid model, and so that all directed edges are oriented in a common (upward) direction. We show that there exists a family of outerplanar directed acyclic graphs whose volume requirement is super-linear. We also prove that for the special case of rooted trees a linear volume upper bound is...

متن کامل

An O(n) Approximation and Integrality Gap for Disjoint Paths and Unsplittable Flow in Undirected Graphs and DAGs

We consider the maximization version of the edge-disjoint path problem (EDP). In undirected graphs and directed acyclic graphs, we obtain an O( √ n) upper bound on the approximation ratio where n is the number of nodes in the graph. We show this by establishing the upper bound on the integrality gap of the natural relaxation based on multicommodity flows. Our upper bound matches within a consta...

متن کامل

n) approximation and integrality gap for EDP and UFP in undirected graphs and DAGs

We consider the maximization version of the edge disjoint path problem (EDP). In undirected graphs and directed acyclic graphs, we obtain anO( √ n) upper bound on the approximation ratio where n is the number of nodes in the graph. We show this by establishing the upper bound on the integrality gap of the natural multicommodity flow based relaxation. Our upper bound matches to within a constant...

متن کامل

An O(sqrt(n)) Approximation and Integrality Gap for Disjoint Paths and Unsplittable Flow

We consider the maximization version of the edge-disjoint path problem (EDP). In undirected graphs and directed acyclic graphs, we obtain an O( √ n) upper bound on the approximation ratio where n is the number of nodes in the graph. We show this by establishing the upper bound on the integrality gap of the natural relaxation based on multicommodity flows. Our upper bound matches within a consta...

متن کامل

Game chromatic number of graphs

y Abstract We show that if a graph has acyclic chromatic number k, then its game chromatic number is at most k(k + 1). By applying the known upper bounds for the acyclic chromatic numbers of various classes of graphs, we obtain upper bounds for the game chromatic number of these classes of graphs. In particular, since a planar graph has acyclic chromatic number at most 5, we conclude that the g...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011